Position and Trajector Learning for Microphone Arrays
نویسندگان
چکیده
In this paper we tackle the problem of source localization by example. We present a methodology that allows a user to train a microphone array system using signals from a set of positions and trajectories and subsequently recall the localization information when presented with new input signals. To do so we present a new statistical model which is capable of accurately describing features from the cross spectra of the microphone signals so as to model the room responses from all positions of interest. We further extend this model to allow modeling of sequences of positions, thereby also enabling the learning and recognition of trajectories. Because of its learning nature this method provides practical advantages in setting up a microphone array, by not requiring favorable room acoustics, careful element positioning or uniformity of sensors. It also introduces an approach to localization which can be extended to other problems requiring models of transfer functions. We present tests on synthetic and real-world data and present the resulting recognition rates for a variety of situations. IEEE Transactions on Audio, Speech and Language Processing This work may not be copied or reproduced in whole or in part for any commercial purpose. Permission to copy in whole or in part without payment of fee is granted for nonprofit educational and research purposes provided that all such whole or partial copies include the following: a notice that such copying is by permission of Mitsubishi Electric Research Laboratories, Inc.; an acknowledgment of the authors and individual contributions to the work; and all applicable portions of the copyright notice. Copying, reproduction, or republishing for any other purpose shall require a license with payment of fee to Mitsubishi Electric Research Laboratories, Inc. All rights reserved. Copyright c ©Mitsubishi Electric Research Laboratories, Inc., 2007 201 Broadway, Cambridge, Massachusetts 02139
منابع مشابه
شکلدهی وفقی و هوشمند پرتو در آرایههای میکروفونی Ad-hoc با استفاده از خوشهبندی و رتبهبندی میکروفونها
Considering the existence of a many speech degradation factors, speech enhancement has become an important topic in the field of speech processing. Beamforming is one of the well-known methods for improving the speech quality that is conventionally applied using regular (classical) microphone arrays. Due to the restrictions in the regular arrangement of microphones, in recent years there has be...
متن کاملEstimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients
This paper presents a talker’s head orientation estimation method using 2-channel microphones. In recent research, some approaches based on a network of microphone arrays have been proposed in order to estimate the talker’s head orientation. In those methods, the talker’s head orientation is estimated using the sound amplitude or peak value of CSP (Cross-power Spectrum Phase) coefficients obtai...
متن کاملDesign of broadband beamformers robust against microphone position errors
Fixed broadband beamformers using small-sized microphone arrays are known to be highly sensitive to errors in the microphone array characteristics. This paper describes a procedure for designing broadband beamformers with an arbitrary spatial directivity pattern, which are robust against errors in the microphone positions. The presented design procedure optimises the mean performance of the bro...
متن کاملSpaciousness Rating of 8-channel Stereophony-Based Microphone Arrays
In previous studies, the localisation accuracy and the spatial impression of 3-2 stereo microphone arrays were discussed. These showed that 3-2 stereo cannot produce stable images to the side and to the rear of the listener. An octagon loudspeaker array was therefore proposed. Microphone array design for this loudspeaker configuration was studied in terms of localisation accuracy, locatedness a...
متن کاملAnalysis of the average performance of the multi-channel Wiener filter for distributed microphone arrays using statistical room acoustics
For most multi-microphone noise reduction algorithms, e.g. the multi-channel Wiener filter (MWF), it is well known that the performance depends on the acoustic scenario at hand, i.e. the used microphone array, the position of the desired source and the noise field. Since the position of the desired source is not always known a priori, it is of great interest in many applications to be able to c...
متن کامل